Providing perspective-dependent views to video conference participants

ABSTRACT

During a video conference between a local endpoint and a remote endpoint, a display at the local endpoint may be configured to provide perspective-dependent views to local video conference participants. A local endpoint may receive a plurality of video streams and identify a first video stream that provides a first view of a remote participant and a second video stream that provides a second view of that participant taken concurrently from a different angle. A display at the local endpoint may display the first video stream at a first viewing angle that only allows the first view of the remote participant to be seen from a first region. The display may also concurrently display the second video stream at a second viewing angle that only allows the second view of the remote participant to be seen from a second region different than the first region.

TECHNICAL FIELD

The present disclosure relates generally to telecommunications.

BACKGROUND

A video conference may exchange audio and video streams betweenparticipants at remote locations. Video streams received from a remotesite may be displayed to local participants on one or more displays, andreceived audio streams may be played by speakers. However, a localdisplay may not fully convey the non-verbal clues (e.g., eye gaze,pointing) provided by a remote speaker because the display may show thesame view of the remote speaker to every local participant. For example,a remote speaker may look at the image of a local participant toindicate that the speaker is talking to that participant; however, thatparticipant may not see the speaker's eye gaze and, thus, may have torely on other clues in order to determine that the speaker is addressinghim or her.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the present invention and itsadvantages, reference is made to the following description taken inconjunction with the accompanying drawings in which:

FIG. 1 illustrates a communications system that includes two endpointsengaged in a video conference;

FIGS. 2A-2B illustrate endpoints that use cameras and multiple viewdisplay devices to concurrently provide local participants withperspective-dependent views of remote participants;

FIGS. 3A-3B illustrate a multiple view display device that employslenticular lenses to provide different views to different participants.

FIGS. 4A-4B illustrate example lenticular lens designs for use inmultiple view display devices;

FIG. 5 is a flowchart illustrating a method by which a first endpointsends video streams to a second endpoint so that the second endpoint mayconcurrently provide different local participants withperspective-dependent views of one or more remote participants; and

FIG. 6 is a flowchart illustrating a method by which a multiple viewdisplay device employing a lenticular lens array may provide differentviews to participants.

DESCRIPTION OF EXAMPLE EMBODIMENTS Overview

In particular embodiments, a method for providing perspective-dependentviews to video conference participants comprises receiving a pluralityof video streams at a local endpoint from a remote endpoint, where thelocal endpoint and the remote endpoint are participating in a videoconference involving a plurality of participants. A first video streamis identified from the plurality of video streams. The first videostream provides a first view of a remote participant at the remoteendpoint. A second video stream is identified from the plurality ofvideo streams. The second video stream provides a second view of theremote participant. The first view and the second view show images ofthe remote participant taken concurrently from different angles. Thefirst video stream is displayed on a display at a first viewing anglethat only allows a local participant to see the first view of the remoteparticipant from a first region. The second video stream is displayed onthe display at a second viewing angle that only allows the localparticipant to see the second view of the remote participant from asecond region, where the second region is different than the firstregion.

Description

FIG. 1 illustrates a communications system, indicated generally at 10,that includes two endpoints engaged in a video conference. Asillustrated, communications system 10 includes network 12 connectingendpoints 14 and a videoconference manager 16. While not illustrated,communications system 10 may also include any other suitable elements tofacilitate video conferences.

In general, during a video conference, a display at a local endpoint 14is configured to concurrently display multiple video streams of a remoteendpoint 14. These video streams may each include an image of the remoteendpoint 14 as seen from different angles or perspectives. By allowingeach video stream to be seen at a different angle, the local display mayprovide local participants with a perspective-dependent view of theremote site. By providing these perspective-dependent views, localparticipants may see and more easily interpret various non-verbalcommunications, such as eye contact and/or pointing, which may result ina more realistic video conferencing experience.

Network 12 interconnects the elements of communications system 10 andfacilitates video conferences between endpoints 14 in communicationssystem 10. While not illustrated, network 12 may include any suitabledevices to facilitate communications between endpoints 14,videoconference manager 16, and other elements in communications system10. Network 12 represents communication equipment including hardware andany appropriate controlling logic for interconnecting elements coupledto or within network 12. Network 12 may include a local area network(LAN), metropolitan area network (MAN), a wide area network (WAN), anyother public or private network, a local, regional, or globalcommunication network, an enterprise intranet, other suitable wirelineor wireless communication link, or any combination of any suitablenetwork. Network 12 may include any combination of gateways, routers,hubs, switches, access points, base stations, and any other hardware orsoftware implementing suitable protocols and communications.

Endpoints 14 represent telecommunications equipment that supportsparticipation in video conferences. A user of communications system 10may employ one of endpoints 14 in order to participate in a videoconference with another one of endpoints 14 or another device incommunications system 10. In particular embodiments, endpoints 14 aredeployed in conference rooms at geographically remote locations.Endpoints 14 may be used during a video conference to provideparticipants with a seamless video conferencing experience that aims toapproximate a face-to-face meeting. Each endpoint 14 may be designed totransmit and receive any suitable number of audio and/or video streamsconveying the sounds and/or images of participants at that endpoint 14.Endpoints 14 in communications system 10 may generate any suitablenumber of audio, video, and/or data streams and receive any suitablenumber of streams from other endpoints 14 participating in a videoconference. Moreover, endpoints 14 may include any suitable componentsand devices to establish and facilitate a video conference using anysuitable protocol techniques or methods. For example, Session InitiationProtocol (SIP) or H.323 may be used. Additionally, endpoints 14 maysupport and be inoperable with other video systems supporting otherstandards such as H.261, H.263, and/or H.264, as well as with pure audiotelephony devices. As illustrated, endpoints 14 include a controller 18,memory 20, network interface 22, microphones 24, speakers 26, cameras28, and displays 30. Also, while not illustrated, endpoints 14 mayinclude any other suitable video conferencing equipment, for example, aspeaker phone, a scanner for transmitting data, and a display forviewing transmitted data.

Controller 18 controls the operation and administration of endpoint 14.Controller 18 may process information and signals received from otherelements such as network interface 22, microphones 24, speakers 26,cameras 28, and displays 30. Controller 18 may include any suitablehardware, software, and/or logic. For example, controller 18 may be aprogrammable logic device, a microcontroller, a microprocessor, anysuitable processing device, or any combination of the preceding. Memory20 may store any data or logic used by controller 18 in providing videoconference functionality. In some embodiments, memory 20 may store all,some, or no data received by elements within its corresponding endpoint14 and data received from remote endpoints 14. Memory 20 may include anyform of volatile or non-volatile memory including, without limitation,magnetic media, optical media, random access memory (RAM), read-onlymemory (ROM), removable media, or any other suitable local or remotememory component. Network interface 22 may communicate information andsignals to and receive information and signals from network 12. Networkinterface 22 represents any port or connection, real or virtual,including any suitable hardware and/or software that allow endpoint 14to exchange information and signals with network 12, other endpoints 14,videoconference manager 16, and/or any other devices in communicationssystem 10.

Microphones 24 and speakers 26 generate and project audio streams duringa video conference. Microphones 24 provide for audio input from usersparticipating in the video conference. Microphones 24 may generate audiostreams from received sound waves. Speakers 26 may include any suitablehardware and/or software to facilitate receiving audio stream(s) andprojecting the received audio stream(s) so that they can be heard by thelocal participants. For example, speakers 26 may include high-fidelityspeakers. Endpoint 14 may contain any suitable number of microphones 24and speakers 26, and they may each be associated with any suitablenumber of participants.

Cameras 28 and displays 30 generate and project video streams during avideo conference. Cameras 28 may include any suitable hardware and/orsoftware to facilitate capturing an image of one or more localparticipants and the surrounding area as well as sending the image toremote participants. Each video signal may be transmitted as a separatevideo stream (e.g., each camera 28 transmits its own video stream). Inparticular embodiments, cameras 28 capture and transmit the image of oneor more users 30 as a high-definition video signal. Displays 30 mayinclude any suitable hardware and/or software to facilitate receivingvideo stream(s) and displaying the received video streams toparticipants. For example, displays 30 may include a notebook PC, a wallmounted monitor, a floor mounted monitor, or a free standing monitor. Inparticular embodiments, one or more of displays 30 are plasma displaydevices or liquid crystal display devices. Endpoint 14 may contain anysuitable number of cameras 28 and displays 30, and they may each beassociated with any suitable number of local participants.

While each endpoint 14 is depicted as a single element containing aparticular configuration and arrangement of modules, it should be notedthat this is a logical depiction, and the constituent components andtheir functionality may be performed by any suitable number, type, andconfiguration of devices. In the illustrated embodiment, communicationssystem 10 includes two endpoints 14 a, 14 b, but it is to be understoodthat communications system 10 may include any suitable number ofendpoints 14.

Videoconference manager 16 generally coordinates the initiation,maintenance, and termination of video conferences between endpoints 14.Video conference manager 16 may obtain information regarding scheduledvideo conferences and may reserve devices in network 12 for each ofthose conferences. In addition to reserving devices or resources priorto initiation of a video conference, videoconference manager may monitorthe progress of the video conference and may modify reservations asappropriate. Also, video conference manager 16 may be responsible forfreeing resources after a video conference is terminated. Although videoconference manager 16 has been illustrated and described as a singledevice connected to network 12, it is to be understood that itsfunctionality may be implemented by any suitable number of deviceslocated at one or more locations in communication system 10.

In an example operation, one of endpoints 14 a, 14 b initiates a videoconference with the other of endpoints 14 a, 14 b. The initiatingendpoint 14 may send a message to video conference manager 16 thatincludes details specifying the time of, endpoints 14 to participate in,and estimated duration of the desired video conference. Video conferencemanager 16 may then reserve resources in network 12 and may facilitatethe signaling required to initiate the video conference between endpoint14 a and endpoint 14 b. During the video conference, endpoints 14 a, 14b may exchange one or more audio streams, one or more video streams, andone or more data streams. In particular embodiments, endpoint 14 a maysend and receive the same number of video streams as endpoint 14 b. Incertain embodiments, each of endpoints 14 a, 14 b send and receive thesame number of audio streams and video streams. In some embodiments,endpoints 14 a, 14 b send and receive more video streams than audiostreams.

During the video conference, each endpoint 14 a, 14 b may generate andtransmit multiple video streams that provide differentperspective-dependent views to the other endpoint 14 a, 14 b. Forexample, endpoint 14 a may generate three video streams that eachprovide a perspective-dependent view of participants at endpoint 14 a.These may show the participants at endpoint 14 a from three differentangles, e.g., left, center, and right. After receiving these videostreams, endpoint 14 b may concurrently display these three videostreams on a display so that participants situated to the left of thedisplay view one of the video streams, while participants situateddirectly in front of the display view a second of the video streams.Likewise, participants situated to the right of the display may view thethird of the video streams. Accordingly, endpoint 14 b may displaydifferent perspective-dependent views of remote participants to localparticipants. By providing different images to different participants,local participants may be able to more easily interpret the meaning ofcertain nonverbal clues (e.g., eye gaze, pointing) while looking at atwo-dimensional image of a remote participant.

When the participants decide that the video conference should beterminated, endpoint 14 a or endpoint 14 b may send a message to videoconference manager 16, who may then un-reserve the reserved resources innetwork 12 and facilitate signaling to terminate the video conference.While this video conference has been described as occurring between twoendpoints—endpoint 14 a and endpoint 14 b—it is to be understood thatany suitable number of endpoints 14 at any suitable locations may beinvolved in a video conference.

An example of a communications system with two endpoints engaged in avideo conference has been described. This example is provided to explaina particular embodiment and is not intended to be all inclusive. Whilesystem 10 is depicted as containing a certain configuration andarrangement of elements, it should be noted that this is simply alogical depiction, and the components and functionality of system 10 maybe combined, separated and distributed as appropriate both logically andphysically. Also, the functionality of system 10 may be provided by anysuitable collection and arrangement of components.

FIGS. 2A-2B illustrate endpoints, indicated generally at 50 and 70, thatuse cameras and multiple view display devices to concurrently providelocal participants with perspective-dependent views of remoteparticipants. As used throughout this disclosure, “local” and “remote”are used as relational terms to identify, from the perspective of a“local” endpoint, the interactions between and operations andfunctionality within multiple different endpoints participating in avideo conference. Accordingly, the terms “local” and “remote” may beswitched when the perspective is that of the other endpoint.

FIG. 2A illustrates an example of a setup that may be provided atendpoint 50. In particular embodiments, endpoint 50 is one of endpoints14. As illustrated, endpoint 50 includes a table 52, three participants54, three displays 56, and three camera clusters 58. While notillustrated, endpoint 50 may also include any suitable number ofmicrophones, speakers, data input devices, data output devices, and/orany other suitable equipment to be used during or in conjunction with avideo conference.

As illustrated, participants 54 a, 54 b, 54 c are positioned around oneside of table 52. On the other side of table 52 sits three displays 56d, 56 e, 56 f, and one of camera clusters 58d, 58e, 58f is positionedabove each display 56 d, 56 e, 56 f. In the illustrated embodiment, eachcamera cluster 58 contains three cameras, with one camera pointed in thedirection of each of the local participants 54 a, 54 b, 54 c. Whileendpoint 50 is shown having this particular configuration, it is to beunderstood that any suitable configuration may be employed at endpoint50 in order to facilitate a desired video conference betweenparticipants at endpoint 50 and participants at a remote endpoint 14. Asan example, camera clusters 58 may be positioned below or behinddisplays 56. Additionally, endpoint 50 may include any suitable numberof participants 54, displays 56, and camera clusters 58.

In the illustrated embodiment, each display 56 d, 56 e, 56 f shows oneof the remote participants 54 d, 54 e, 54 f. Display 56 d shows theimage of remote participant 54 d; display 56 e shows the image of remoteparticipant 54 e; and display 56 f shows the image of remote participant54 f. These remote participants may be participating in the videoconference through a remote endpoint 70, as is described below withrespect to FIG. 2B. Using traditional methods, each local participant 54a, 54 b, 54 c would see the same image of each remote participant 54.For example, when three different individuals look at a traditionaltelevision screen or computer monitor, each individual sees the sametwo-dimensional image as the other two individuals. However, whenmultiple individuals see the same image, they may be unable todistinguish perspective-dependent non-verbal clues provided by theimage. For example, remote participant 54 may point at one of the threelocal participants 54 a, 54 b, 54 c to indicate to whom he is speaking.If the three local participants 54 a, 54 b, 54 c view the sametwo-dimensional image of the remote participant 54, it may be difficultto determine which of the local participants 54 has been selected by theremote participant 54 because the local participants 54 would not easilyunderstand the non-verbal clue provided by the remote participant 54.

However, displays 56 are configured to provide multipleperspective-dependent views to local participants 54. As an example,consider display 56 e, which shows an image of remote participant 54 e.In the illustrated embodiment, display 56 e concurrently displays threedifferent perspective-dependent views of remote participant 54 e. Localparticipant 54 a sees view A; local participant 54 b sees view B; andparticipant 54 c sees view C. Views A, B, and C all show differentperspective-dependent views of remote participant 54 e. View A may showan image of remote participant 54 e from the left of remote participant54 e. Likewise, views B and C may show an image of remote participant 54e from the center and right, respectively, of remote participant 54 e.In particular embodiments, view A shows the image of remote participant54 e that would be seen from a camera placed substantially near theimage of local participant 54 a that is presented to remote participant54 e. Accordingly, when remote participant 54 e looks at the displayedimage of local participant 54 a, it appears (to local participant 54 a)as if remote participant 54 e were looking directly at local participant54 a. Concurrently, and by similar techniques, views B and C (shown toparticipants 54 b and 54 c, respectively) may see an image of remoteparticipant 54 e that indicated that remote participant 54 e was lookingat local participant 54 a. In certain embodiments, displays 56 createmultiple perspective-dependent views using lenticular lenses. Forexample, FIG. 3 illustrates a multiple view display device that employslenticular lenses to provide different views to different participants.In some embodiments, displays 56 create multiple perspective-dependentviews using barrier technology. In barrier technology, a physicalchannel (e.g., a grate or slat) guides light in a particular direction.For example, barrier technology may be used in privacy screens placed onlaptops to restrict the angles at which the laptop screen can be viewed.

Camera clusters 58 generate video streams conveying the image of localparticipants 54 a, 54 b, 54 c for transmission to remote participants 54d, 54 e, 54 f. These video streams may be generated in a substantiallysimilar way as is described below in FIG. 2B with respect to remoteendpoint 70. Moreover, the video streams may be displayed by remotedisplays 58 in a substantially similar way to that previously describedfor local displays 56 d, 56 e, 56 f.

FIG. 2B illustrates an example of a setup that may be provided at theremote endpoint described above, indicated generally at 70. Inparticular embodiments, endpoint 70 is one of endpoints 14 a, 14 b incommunication system 10. As illustrated, endpoint 70 includes a table72, participants 54 d, 54 e, and 54 f, displays 56, and camera clusters58.

In the illustrated embodiment, three participants 54 d, 54 e, 54 f localto endpoint 70 sit on one side of table 72 while three displays 56 a, 56b, and 56 c are positioned on the other side of table 72. Each display56 a, 56 b, and 56 c shows an image of a corresponding participant 54remote to endpoint 70. These displays 56 a, 56 b, and 56 c may besubstantially similar to displays 56 d, 56 e, 56 f at endpoint 50. Thesedisplayed participants may be the participants 54 a, 54 b, 54 cdescribed above as participating in a video conference through endpoint50. Above each display 56 is positioned a corresponding camera cluster58. While endpoint 70 is shown having this particular configuration, itis to be understood that any suitable configuration may be employed atendpoint 70 in order to facilitate a desired video conference betweenparticipants at endpoint 70 and a remote endpoint 14 (which, in theillustrated embodiment, is endpoint 50). As an example, camera clusters58 may be positioned below or behind displays 56. Additionally, endpoint70 may include any suitable number of participants 54, displays 56, andcamera clusters 58.

As illustrated, each camera cluster 58 a, 58 b, 58 c includes threecameras that are each able to generate a video stream. Accordingly, withthe illustrated configuration, endpoint 70 includes nine cameras. Inparticular embodiments, fewer cameras are used and certain video streamsor portions of a video stream are synthesized using a mathematicalmodel. In other embodiments, more cameras are used to create multiplethree dimensional images of participants 54. In some embodiments, thecameras in camera clusters 58 are cameras 28.

In each camera cluster 58, one camera is positioned to capture the imageof one of the local participants 54 d, 54 e, 54 f. Accordingly, eachlocal participant 54 d, 54 e, 54 f has three cameras, one from eachcamera cluster 58, directed towards him or her. For example, threedifferent video streams containing an image of participant 54 e may begenerated by the middle camera in camera cluster 58 a, the middle camerain camera cluster 58 b, and the middle camera in camera cluster 58 c, asis illustrated by the shaded cameras. The three cameras corresponding tolocal participant 54 e will each generate an image of participant 54 efrom a different angle. Likewise, three video streams may be created toinclude different perspectives of participant 54 d, and three videostreams may be created to include different perspectives of participant54 f.

The images generated by the cameras in camera clusters 58 a, 58 b, 58 cmay be transmitted to remote endpoint 50. After correlating each videostream to its corresponding participant 54 d, 54 e, 54 f, the videostreams may be concurrently displayed on displays 56 d, 56 e, 56 f asdescribed above. Taking the three streams corresponding to participant54 e as an example, one of these three video streams may provide view Ato participant 54 a, as is illustrated in both FIGS. 2A & 2B. Likewise,a second video stream may provide view B to participant 54 b, and thethird video stream may provide view C to participant 54 c.

In operation, camera clusters 58 a, 58 b, 58 c at endpoint 70 maygenerate nine video streams containing different perspective-dependentviews of participants 54 d, 54 e, 45 f. For example, camera cluster 58 amay generate a first video stream corresponding to participant 54 d, asecond video stream corresponding to participant 54 e, and a third videostream corresponding to participant 54 f. Likewise, camera clusters 58b, 58 c may each generate three video streams—one corresponding toparticipant 54 d, one corresponding to participant 54 e, and onecorresponding to participant 54 f. Accordingly, in this particularembodiment, endpoint 70 will generate nine total video streams,including three perspective dependent views of participant 54 d, threeperspective dependent views of participant 54 e, and three perspectivedependent views of participant 54 f. These video streams may be marked,organized, and/or compressed before they are transmitted to endpoint 50.

After receiving these video streams, endpoint 50 may identify the threevideo streams corresponding to participant 54 d, the three video streamscorresponding to participant 54 e, and the three video streamscorresponding to participant 54 f. Endpoint 50 may then concurrentlydisplay the video streams corresponding to each particular participant54 d, 54 e, 54 f on that participant's corresponding display 56 d, 56 e,56 f. For example, display 56 e may concurrently display three videostreams corresponding to participant 54 e. These three video streams maybe displayed so that participant 54 a views participant 54 e from afirst perspective, participant 54 b views participant 54 e from a secondperspective, and participant 54 c views participant 54 e from a thirdperspective. These views may correspond to views A, B, and C, asillustrated in FIGS. 2A & 2B. Because display 56 e may provide multipleperspective-dependent views of participant 54 e to local participants 54a, 54 b, 54 c, those local participants may be able to more easilyinterpret non-verbal cues, such as eye gaze and pointing, given byparticipant 54 e during a video conference. Displays 56 d and 56 f mayoperate similarly to display 56 e. Additionally, while the transmissionof video streams from endpoint 50 to endpoint 70 has been described indetail, it is understood that the transmission of video streams fromendpoint 70 to endpoint 50 may include similar methods.

Particular embodiments of endpoints 50, 70 and their constituentcomponents have been described and are not intended to be all inclusive.While these endpoints 50, 70 are depicted as containing a certainconfiguration and arrangement of elements, components, devices, etc., itshould be noted that this is simply an example, and the components andfunctionality of each endpoint 50, 70 may be combined, separated anddistributed as appropriate both logically and physically. In particularembodiments, endpoint 50 and endpoint 70 have substantially similarconfigurations and include substantially similar functionality. In otherembodiments, each of endpoints 50, 70 may include any suitableconfiguration, which may be the same as, different than, or similar tothe configuration of another endpoint participating in a videoconference. Moreover, while endpoints 50, 70 are described as eachincluding three participants 54, three displays 56, and three cameraclusters 58, endpoints 50, 70 may include any suitable number ofparticipant 54, displays 56, and camera clusters 58. In addition, thenumber of participant 54, displays 56, and/or camera clusters 58 maydiffer from the number of one or more of the other described aspects ofendpoint 50, 70. Any suitable number of video streams may be generatedto convey the image of participants 54 during a video conference.

FIGS. 3A-3B illustrate a multiple view display device, indicatedgenerally at 80, that employs lenticular lenses to provide differentviews to different participants. In the illustrated embodiment, threedifferent views 82 a, 82 b, 82 c are provided to three participants 84a, 84 b, 84 c. For example, these different views may correspond toviews A, B, and C of participant 54 e that are provided to participants54 a, 54 b, 54 c during a video conference. In particular embodiments,multiple view display device 80 is one of displays 56.

FIG. 3A shows a view from above multiple view display device 80,illustrating the different views 82 a, 82 b, 82 c provided toparticipants 84 a, 84 b, 84 c. As illustrated, multiple view displaydevice 80 includes a display controller 86 that has three displaydrivers 88 a, 88 b, 88 c, a screen 90, and a lenticular lens array 92that includes lenticular lenses 94.

Display controller 86 receives data corresponding to images to bedisplayed by multiple view display 80 and drives the illumination ofpixels 96 on screen 90. In the illustrated embodiment, displaycontroller 86 includes three display drivers 88 a, 88 b, 88 c. Displaydriver 88 a may be responsible for controlling a first portion of screen90 corresponding to a first displayed image. Display driver 88 b may beresponsible for controlling a second portion of screen 90 correspondingto a second displayed image. Display driver 88 c may be responsible forcontrolling a third portion of screen 90 corresponding to a thirddisplayed image.

A partial row of pixels 96 on screen 90 is illustrated. Pixels 96 may bedivided into three portions, or sets. In the illustrated row of pixels96, set A includes pixels 96 a ₁, 96 a ₂, set B includes pixels 96 b ₁,96 b ₂, and set C includes pixels 96 c ₁, 96 c ₂. Each set of pixels 96a, 96 b, 96 c may correspond to a different image to be displayed by amultiple view display 80. In particular embodiments, these differentimages are different perspective-dependent views of a particularparticipant 54 or participants 54 participating in a video conference.In other embodiments, any suitable images may be simultaneouslydisplayed on multiple view display device 80. As used herein, “image” ismeant to broadly encompass any visual data or information. As anexample, an image may be a still image. As another example, may be theresult of displaying a video stream.

Lenticular lens array 92 may be placed adjacent to screen 90 and mayinclude a plurality of lenticular lenses 94. In the illustratedembodiment, lenticular lens 94 is shown from a top-view. As is partiallyillustrated, other lenticular lenses 94 may be placed next to lenticularlens 94 to form lenticular lens array 92. Each lenticular lens 94 may beshaped similar to a cylinder cut in half along its diameter.Accordingly, looking at a single row of screen 90, lenticular lens 94appears to be a semi-circle with a diameter substantially equal to thewidth of three pixels 96. While lenticular lens array 92 is illustratedas having lenticular lens 94 extend vertically on screen 90, lenticularlens array 92 may incorporate lenticular lenses 94 extendinghorizontally, diagonally, or in any other suitable manner. In particularembodiments, lenticular lens 94 is substantially semicircular. In someembodiments, lenticular lens 94 is a smaller or larger arc of acylinder. In other embodiments, lenticular lens 94 may take a variety ofdifferent shapes, and two particular examples are described with respectto FIGS. 4A & 4B.

Generally, lenticular lens 94 focuses the light generated by pixels 96to provide a plurality of views. As illustrated, lenticular lens 94focuses the light generated by: pixel 96 a ₂ into view 82 a seen byparticipant 84 a; pixel 96 b ₁ into view 82 b seen by participant 84 b;and pixel 96 c ₁ into view 82 c seen by participant 84 c. While notillustrated, other lenticular lenses 94 in lenticular lens array 92 mayfocus pixels 96 in the different pixel groups 96 a, 96 b, 96 c in asimilar manner. By focusing light in this way, pixel group 96 a (withview 82 a) may be seen by participant 84 a, but not by participant 84 bor participant 84 c. Similarly, pixel group 96 b (with view 82 b) may beseen by participant 84 b, but not by participant 84 a or participant 84c, and pixel group 96 c (with view 82 c) may be seen by participant 84c, but not by participant 84 a or participant 84 b. Accordingly, in theillustrated embodiment of multiple view display device 80, a first imagemay be displayed using pixels 96 in pixel group 96 a, a second image maybe concurrently displayed using pixels 96 in pixel group 96 b, and athird image may be concurrently displayed using pixels 96 in pixel group96 c. These three images may then be seen by a respective one ofparticipants 84 a, 84 b, 84 c.

In an example operation, display controller 86 may receive a pluralityof video streams to be displayed on multiple view display device 80.Display controller 86 may identify the received video stream(s) thatcorrespond to view 82 a, view 82 b, and view 82 c. Then, displaycontroller 86 may send the video stream corresponding to view 82 a todisplay driver 88 a, the video stream corresponding to view 82 b todisplay driver 88 b, and the video stream corresponding to view 82 c todisplay driver 88 c. Each display driver 88 a, 88 b, 88 c may controlthe operation of the corresponding set of pixels 96 a, 98 b, 98 c. Forexample, display driver 88 a may display a first video stream on pixelgroup 96 a, display driver 88 b may concurrently display a second videostream on pixel group 96 b, and display driver 88 c may concurrentlydisplay a third video stream on pixel group 96 c. The light emitted byeach of these pixel groups 96 a, 96 b, 96 c may be focused into adifferent view 82 a, 82 b, 82 c. These views 82 a, 82 b, 82 c may beseen by corresponding participants 84 a, 84 b, 84 c. Accordingly, afirst participant 84 a may see a first video stream because it isdisplayed with pixel group 96 a, while second and third participants 84b, 84 c see second and third video streams because they are displayedwith pixel groups 96 b, 96 c, respectively.

In particular embodiments, multiple view display device 80 displaysmultiple video streams. In other embodiments, multiple view displaydevice 80 displays multiple still images. In certain embodiments,multiple view display device 80 displays any suitable number of videostreams and/or still images. For example, in one embodiment, multipleview display device 80 may be configured to concurrently display twoimages—one video image and one still image. In addition, while multipleview display device 80 has been described in conjunction with acommunications system and endpoints that support a video conference, itis to be understood that multiple view display device 80 may be used ina variety of different applications.

FIG. 3B illustrates an expanded view of a portion of screen 90. Asillustrated, screen 90 is shown from a horizontal angle. In particularembodiments, the illustrated portion of screen 90 represents a smallpercentage of screen 90 as it would be seen by participants 84.

As illustrated, screen 90 is comprised of a plurality of pixels 96 andconstituent sub-pixels arranged in a matrix having columns 98 and rows100. Columns 98 comprise a first set of columns (designated with A), asecond set of columns (designated with B), and a third set of columns(designated with C). Column 98 a may include pixels in pixel group 96 a,column 98 b may include pixels in pixel group 96 b, and column 98 c mayinclude pixels in pixel group 96 c. Accordingly, as described above,column 98 a (and other columns 98 belonging to set A) may display afirst image to a first participant 84 a, column 98 b (and other columns98 belonging to set B) may concurrently display a second image to asecond participant 84 b, and column 98 c (and other columns 98 belongingto set C) may display a third image to a third participant 84 c.

In the illustrated embodiment, rows 100 divide pixels 96 into blue, red,and green sub-pixels 102. A first row 104 may include blue sub-pixels102, a second row 106 may include red sub-pixels 102, and a third row108 may include green sub-pixels. This blue, red, green combination mayrepeat along rows 100. Three rows 104, 106, 108 of sub-pixels 102 may beemployed in order to generate the image created by one particular pixel96, and that one particular pixel may correspond to pixel group 96 a,pixel group 96 b, or pixel group 96 c.

Particular embodiments of a multiple view display device 80 have beendescribed and are not intended to be all inclusive. While the multipleview display device is depicted as containing a certain configurationand arrangement of elements, components, devices, etc., it should benoted that this is simply an example, and the components andfunctionality of the devices may be combined, separated and distributedas appropriate both logically and physically. For example, while screen90 is described and illustrated as being comprised of pixels 96 havingsub-pixels 102, it is understood that any suitable design may be used.

Moreover, while the illustrated embodiment shows columns 98 as verticalcomponents of the matrix and rows 100 as horizontal components of thematrix, it is to be understood that a “column” (as the term is usedherein) can have any linear orientation (i.e., vertical, horizontal, ordiagonal) and a “row” (as the term is used herein) can have any linearorientation. For example, rows 100 of blue, red, and green subpixels 102may each be oriented vertically, while columns 98 of pixel groups 96 a,96 b, and 96 c are oriented horizontally. Also, a “pixel” may be anysuitable component, device, or element that emits light. In particularembodiments, screen 90 is a plasma display device, which includes amatrix of rows and columns. Finally, the functionality of the multipleview display devices may be provided by any suitable collection andarrangement of components.

FIGS. 4A-4B show example lenticular lens designs for use in multipleview display devices. In particular embodiments, these multiple viewdisplay devices are multiple view display device 80.

FIG. 4A illustrates a lenticular lens indicated generally at 120.Lenticular lens 120 may be lenticular lens 94 in lenticular lens array92. As illustrated, lenticular lens 120 includes three sub-lenses 122 a,122 b, 122 c. Sub-lenses 122 may be designed to focus light emitted by acorresponding pixel 124. For example, sub-lens 122 a may be configuredwith a specific curvature that will focus the light generated by pixel124 a on participant 126 a. In particular embodiments, the design ofsub-lens 122 a takes into account the distance 128 between participant126 a and sub-lens 122 a. The design of sub-lens 122 a may also takeinto account the distance between sub-lens 122 a and pixel 124 a.

Likewise, sub-lens 122 b may be designed so that the light emitted bypixel 124 b is properly focused for participant 126 b, and sub-lens 122c may be designed so that the light emitted by pixel 124 c is properlyfocused for participant 126 c. As illustrated, the distance between afirst participant 126 and a corresponding sub-lens 122 may differ fromthe distance between a second participant 126 and a correspondingsub-lens 122. As is also illustrated, the distance between a first pixel(e.g., pixel 124 a) and its corresponding sub-lens 122 will differ fromthe distance between another pixel (e.g., pixel 124 b) and itscorresponding sub-lens 122. In order to accommodate these differentoptical situations, sub-lenses 122 a, 122 b, 122 c may have differentcurvatures, thicknesses, or indices of refraction.

FIG. 4B illustrates a first lenticular lens, indicated generally at 130,located at the left of a screen 138 and a second lenticular lens,indicated generally at 132, located at the right of screen 138.Lenticular lens 130 and lenticular lens 132 may have differentstructures to accommodate the different optical situations encounteredby light emitted by pixels located on different ends of the same screen.

Lenticular lens 130 focuses light from pixels 134 a, 134 b, and 134 cwhile lenticular lens 132 focuses light generated by pixels 136 a, 136b, and 136 c. As illustrated, screen 138 can be viewed by threeparticipants 140: participant 140 a views screen 138 from the left,participant 140 b views screen 138 from the center, and participant 140c views screen 138 from the right. As can also been seen in the figure,the distance between a particular participants 140 and lenticular lens130 may be different from the distance between that participant 140 andlenticular lens 132. For example, the distance between participant 140 aand lenticular lens 130 is much less than the distance betweenparticipant 140 a and lenticular lens 132. As another example, thedistance between lenticular lens 130 and participant 140 c is muchgreater than the distance between lenticular lens 132 and participant140 c. Because of the differing distances between a given participant140 a, 140 c and a given lenticular lenses 130, 132, the shape oflenticular lens 130 may differ from the shape of lenticular lens 132. Byaltering the shape of different lenticular lenses, such as lenticularlens 130 and lenticular lens 132, a lenticular lens array may provide animproved image to participant by more accurately focusing the view seenby that participant. Similarly, other lenticular lenses (notillustrated) that are located between lenticular lens 130 and lenticularlens 132 may gradually incorporate the changes found between theselenticular lenses 130, 132. Each lenticular lens may be designed providethe most effective user experience that focuses the relevant pixels 134,136 for a particular participant 140.

Additionally, participants 140 may view additional displays 56 (notillustrated). For example, one display may include a second screenplaced to the left of screen 138 and a second display may include athird screen placed to the right of screen 138. These second and thirdscreens may include lenticular lens arrays which include lenticularlenses. The lenticular lenses within each lenticular lens arrays maydiffer from each other; and these lenticular lens arrays may differ fromone another. In particular embodiments, enhanced multiple view displaysare accomplished by altering the shape of each lenticular lens within alenticular lens array and each lenticular lens array associated witheach multiple view display shown to video conference participants.

Particular examples of lenticular lens designs and lenticular lens arraydesigns have been described and are not intended to be all inclusive.While the designs are described depicted as containing a certainconfiguration and including certain elements, it should be noted thatthese are simply examples.

FIG. 5 is a flowchart illustrating a method, indicated generally at 150,by which a first endpoint sends video streams to a second endpoint sothat the second endpoint may concurrently provide different localparticipants with perspective-dependent views of one or more remoteparticipants. As illustrated, method 150 shows the operations of anendpoint 152 and an endpoint 154 participating in a video conference. Inparticular embodiments, endpoint 152 and/or endpoint 154 may be endpoint14, endpoint 50, and/or endpoint 70.

At step 156, endpoint 152 generates video streams to be sent to endpoint154 during a video conference. For example, one or more camera clusters58 at endpoint 152 may generate a plurality of video streams that eachinclude an image of one or more participants involved in the videoconference through endpoint 152. Camera clusters 58 may include one ormore cameras, such as cameras 28. In particular embodiments, endpoint152 generates nine video streams that include threeperspective-dependent views of each of three participants. In otherembodiments, endpoint 152 generates a number of video streams equal tothe number of local participants 54 multiplied by the number of remoteparticipants 54. Accordingly, each remote participant 54 may receive hisor her own perspective-dependent view of each local participant 54. Inother embodiments, more or fewer video streams may be generated byendpoint 152.

After generating the video streams, endpoint 152 determines whether ornot to compress this data, in step 158. The determination may be basedon a variety of factors, for example, the bandwidth available for avideo conference, the degree to which related video streams may becompressed, and other suitable factors. If endpoint 152 decides tocompress one or more video streams, it compresses the determined videostreams in step 160; otherwise, method 150 proceeds to step 162. Forexample, endpoint 152 may compress the video streams corresponding todifferent perspective-dependent views of the same participant 54. Afterthis compression, multiple generated video streams may be sent as asingle video stream. In particular embodiments, different views of asingle participant 54 may be compressible because there may beredundancy in the different images. In certain embodiments, endpoint 152may use any suitable techniques to compress or reduce the bandwidthrequirements of the generated video streams. At step 162, endpoint 152transmits the video stream(s) to endpoint 154.

At step 164, endpoint 154 receives the video streams from the endpoint152. Endpoint 154 may then determine whether or not endpoint 152compressed the received video stream data, in step 166. In particularembodiments, endpoint 154 analyzes information carried by the receiveddata in order to determine whether or not the video streams werecompressed. In other embodiments, endpoint 154 determine that endpoint152 compressed the video stream(s) based upon pre-established parametersfor the video conference. If endpoint 152 compressed the video streamdata, endpoint 154 decompresses the video stream data in step 168;otherwise, endpoint 154 skips step 168.

At step 170, endpoint 154 identifies all video streams containingperspective dependent views of a particular participant 54 at endpoint152. In particular embodiments, endpoint 154 may select a firstparticipant 54 and may identify all video streams corresponding to thatparticipant 54. Once these video streams have been identified, endpoint154 may concurrently display the identified streams on a multiple viewdisplay, in step 172. In particular embodiments, different video streamscontaining images of the particular participant 54 are displayed todifferent participants at endpoint 154. In particular embodiments, amultiple view display is multiple view display device 80. At step 174,endpoint 154 determines whether or not all video streams have beendisplayed. If not, method 150 proceeds to step 176 where endpoint 154identifies the next participant 54. Then, endpoint 154 identifies thevideo streams containing views of that participant 54, in step 170. Whenendpoint 154 determines, in step 174, that all video streams aredisplayed, method 150 ends.

The method described with respect to FIG. 5 is merely illustrative, andit is understood that the manner of operation and devices indicated asperforming the operations may be modified in any appropriate manner.While the method describes particular steps performed in a specificorder, it should be understood that this is merely a logical descriptionand various steps may be performed concurrently and/or in anyappropriate order. For example, endpoint 154 is described as identifyingthe video streams corresponding to a particular participant 54 anddisplaying those streams before moving to the next participant 54;however, it is to be understood that, in many embodiments, thisidentification and display is processed in parallel. In particularembodiments, a similar operation occurs for the generation,transmission, and display of video streams from endpoint 154 to endpoint152. Moreover, communications system 10 contemplates any suitablecollection and arrangement of elements performing some, all, or none ofthese steps.

FIG. 6 is a flowchart illustrating a method, indicated generally at 180,by which a multiple view display device employing a lenticular lensarray may provide different views to participants.

At step 182, multiple view display device 80 receives three videostreams. In particular embodiments, each video stream corresponds to adifferent perspective-dependent view of a remote participant that isinvolved in a video conference. In other embodiments, rather thanreceiving video streams, multiple view display device 80 receives datacontaining a still image. At step 184, a display controller (e.g.,display controller 86) identifies a left stream, a center stream, and aright stream. In particular embodiments, the left stream is displayed toa participant 84 on the left side of multiple view display device 80. Inother embodiments, any suitable method may be used to determine whichreceived stream should be displayed at which display location.

At step 186, display driver 88 a displays the left stream on the firstset of pixels. For example, the first set of pixels may correspond withpixel group 96 a, which includes, for example, pixels 96 a ₁ and 96 a ₂.The image generated by these pixels may be focused by lenticular lensesin a lenticular lens array (e.g., lenticular lenses 94 in lenticularlens array 92) in order to provide a first view to a first participant(e.g., view 82 a to participant 84 a). At step 188, display driver 88 bdisplays the center video stream on a second set of pixels. The centerstream may include a second image different than the image displayed bythe left stream. This image may be a different perspective of the sameremote participant participating in a video conference. In particularembodiments, the second set of pixels corresponds to pixel group 96 b,which includes, for example, pixels 96 b ₁ and 96 b ₂. The imagegenerated by these pixels may be focused by lenticular lenses in alenticular lens array (e.g., lenticular lenses 94 in lenticular lensarray 92) in order to provide a second view to a second participant(e.g., view 82 b to participant 84 b). At step 190, display driver 88 cdisplays the right stream on a third set of pixels. The right stream mayinclude a third image different than the image displayed by the leftstream and different than the image displayed by the center stream. Thisimage may be a different perspective of the same remote participant. Inparticular embodiments, the third set of pixels corresponds to pixelgroup 96 c, which includes, for example, pixels 96 c ₁ and 96 c ₂. Theimage generated by these pixels may be focused by lenticular lenses in alenticular lens array (e.g., lenticular lenses 94 in lenticular lensarray 92) in order to provide a third view to a third participant (e.g.,view 82 c to participant 84 c). By displaying different streams ondifferent sets of pixels and focusing the light generated by thosepixels with a lenticular lens array, a multiple view display device mayconcurrently provide different perspective-dependent images to differentparticipants. After step 190, method 180 ends.

The method described with respect to FIG. 6 is merely illustrative, andit is understood that the manner of operation and devices indicated asperforming the operations may be modified in any appropriate manner.While the method describes particular steps performed in a specificorder, it should be understood that communications system 10contemplates any suitable collection and arrangement of elementsperforming some, all, or none of these steps in any operable order.

Although the present invention has been described in severalembodiments, a myriad of changes and modifications may be suggested toone skilled in the art, and it is intended that the present inventionencompass such changes and modifications as fall within the presentappended claims.

1. A method comprising: receiving a plurality of video streams at a local endpoint from a remote endpoint, the local endpoint and the remote endpoint participating in a video conference involving a plurality of participants; identifying a first video stream in the plurality of video streams, the first video stream providing a first view of a remote participant at the remote endpoint; identifying a second video stream in the plurality of video streams, the second video stream providing a second view of the remote participant, the first view and the second view showing images of the remote participant taken concurrently from different angles; displaying the first video stream on a display at a first viewing angle that only allows a local participant to see the first view of the remote participant from a first region; and displaying the second video stream on the display at a second viewing angle that only allows the local participant to see the second view of the remote participant from a second region, the second region different than the first region.
 2. The method of claim 1, wherein: the display comprises a lenticular lens array including a plurality of lenticular lenses; and each lenticular lens is configured to direct light emitted by a first column of pixels toward the first region and to direct light emitted by a second column of pixels toward the second region.
 3. The method of claim 1, further comprising: identifying a third video stream in the plurality of video streams, the third video stream providing a third view of the remote participant, wherein the first view, the second view, and the third view provide images of the remote participant taken concurrently from different angles; and displaying the third video stream on the display at a third viewing angle that only allows the local participant to see the third view of the remote participant from a third region, the third region different than the first region and different than the second region.
 4. The method of claim 3, wherein: the display comprises a lenticular lens array including a plurality of lenticular lenses; and each lenticular lens is configured to direct light emitted by a first column of pixels toward the first region, to direct light emitted by a second column of pixels toward the second region, and to direct light emitted by a third column of pixels toward the third region.
 5. The method of claim 1, wherein: the first viewing angle allows a first local participant in the first region to see the first view and prohibits a second local participant in the second region from seeing the first view; and the second viewing angle allows the second local participant to see the second view and prohibits the first local participant from seeing the second view.
 6. The method of claim 1, wherein the remote endpoint synthesizes the first video stream and the second video stream using a single camera and a mathematical model.
 7. The method of claim 1, wherein the display is able to display the first video stream and the second video stream using barrier technology.
 8. The method of claim 1, further comprising decompressing one or more of the plurality of video streams.
 9. An apparatus comprising: a network interface operable to receive a plurality of video streams at a local endpoint from a remote endpoint, the local endpoint and the remote endpoint participating in a video conference involving a plurality of participants; a controller operable to identify a first video stream in the plurality of video streams, the first video stream providing a first view of a remote participant at the remote endpoint, the controller further operable to identify a second video stream in the plurality of video streams, the second video stream providing a second view of the remote participant, the first view and the second view providing images of the remote participant taken concurrently from different angles; and a display operable to display the first video stream at a first viewing angle that only allows a local participant to see the first view of the remote participant from a first region, the display further operable to display the second video stream at a second viewing angle that only allows the local participant to see the second view of the remote participant from a second region, the second region different than the first region.
 10. The apparatus of claim 9, wherein: the display comprises a lenticular lens array including a plurality of lenticular lenses; and each lenticular lens is configured to direct light emitted by a first column of pixels toward the first region and to direct light emitted by a second column of pixels toward the second region.
 11. The apparatus of claim 9, wherein: the controller is further operable to identify a third video stream in the plurality of video streams, the third video stream providing a third view of the remote participant, wherein the first view, the second view, and the third view provide images of the remote participant taken concurrently from different angles; and the display is further operable to display the third video stream at a third viewing angle that only allows the local participant to see the third view of the remote participant from a third region, the third region different than the first region and different than the second region.
 12. The apparatus of Claim I1, wherein: the display comprises a lenticular lens array including a plurality of lenticular lenses; and each lenticular lens is configured to direct light emitted by a first column of pixels toward the first region, to direct light emitted by a second column of pixels toward the second region, and to direct light emitted by a third column of pixels toward the third region.
 13. The apparatus of claim 9, wherein: the first viewing angle allows a first local participant in the first region to see the first view and prohibits a second local participant in the second region from seeing the first view; and the second viewing angle allows the second local participant to see the second view and prohibits the first local participant from seeing the second view.
 14. The apparatus of claim 9, wherein the remote endpoint synthesizes the first video stream and the second video stream using a single camera and a mathematical model.
 15. The apparatus of claim 9, wherein the display is operable to display the first video stream and the second video stream using barrier technology.
 16. The apparatus of claim 9, wherein the controller is further operable to decompress one or more of the plurality of video streams.
 17. An apparatus comprising: means for receiving a plurality of video streams at a local endpoint from a remote endpoint, the local endpoint and the remote endpoint participating in a video conference involving a plurality of participants; means for identifying a first video stream in the plurality of video streams, the first video stream providing a first view of a remote participant at the remote endpoint; means for identifying a second video stream in the plurality of video streams, the second video stream providing a second view of the remote participant, the first view and the second view providing images of the remote participant taken concurrently from different angles; means for displaying the first video stream on a display at a first viewing angle that only allows a local participant to see the first view of the remote participant from a first region; and means for displaying the second video stream on the display at a second viewing angle that only allows the local participant to see the second view of the remote participant from a second region, the second region different than the first region.
 18. The apparatus of claim 17, wherein: the display comprises a lenticular lens array including a plurality of lenticular lenses; and each lenticular lens is configured to direct light emitted by a first column of pixels toward the first region and to direct light emitted by a second column of pixels toward the second region.
 19. The apparatus of claim 17, further comprising: means for identifying a third video stream in the plurality of video streams, the third video stream providing a third view of the remote participant, wherein the first view, the second view, and the third view provide images of the remote participant taken concurrently from different angles; and means for displaying the third video stream on the display at a third viewing angle that only allows the local participant to see the third view of the remote participant from a third region, the third region different than the first region and different than the second region.
 20. The apparatus of claim 17, wherein: the first viewing angle allows a first local participant in the first region to see the first view and prohibits a second local participant in the second region from seeing the first view; and the second viewing angle allows the second local participant to see the second view and prohibits the first local participant from seeing the second view. 